<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>
<head>
<link rel="STYLESHEET" href="lib.css" type='text/css' />
<link rel="SHORTCUT ICON" href="../icons/pyfav.png" type="image/png" />
<link rel='start' href='../index.html' title='Python documentation Index' />
<link rel="first" href="lib.html" title='Python library Reference' />
<link rel='contents' href='contents.html' title="Contents" />
<link rel='index' href='genindex.html' title='Index' />
<link rel='last' href='about.html' title='About this document...' />
<link rel='help' href='about.html' title='About this document...' />
<link rel="next" href="differ-objects.html" />
<link rel="prev" href="sequence-matcher.html" />
<link rel="parent" href="module-difflib.html" />
<link rel="next" href="differ-objects.html" />
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta name='aesop' content='information' />
<title>4.4.2 SequenceMatcher Examples </title>
</head>
<body>
<div class="navigation">
<div id='top-navigation-panel' xml:id='top-navigation-panel'>
<table align="center" width="100%" cellpadding="0" cellspacing="2">
<tr>
<td class='online-navigation'><a rel="prev" title="4.4.1 sequencematcher Objects"
  href="sequence-matcher.html"><img src='../icons/previous.png'
  border='0' height='32'  alt='Previous Page' width='32' /></a></td>
<td class='online-navigation'><a rel="parent" title="4.4 difflib  "
  href="module-difflib.html"><img src='../icons/up.png'
  border='0' height='32'  alt='Up one Level' width='32' /></a></td>
<td class='online-navigation'><a rel="next" title="4.4.3 differ Objects"
  href="differ-objects.html"><img src='../icons/next.png'
  border='0' height='32'  alt='Next Page' width='32' /></a></td>
<td align="center" width="100%">Python Library Reference</td>
<td class='online-navigation'><a rel="contents" title="Table of Contents"
  href="contents.html"><img src='../icons/contents.png'
  border='0' height='32'  alt='Contents' width='32' /></a></td>
<td class='online-navigation'><a href="modindex.html" title="Module Index"><img src='../icons/modules.png'
  border='0' height='32'  alt='Module Index' width='32' /></a></td>
<td class='online-navigation'><a rel="index" title="Index"
  href="genindex.html"><img src='../icons/index.png'
  border='0' height='32'  alt='Index' width='32' /></a></td>
</tr></table>
<div class='online-navigation'>
<b class="navlabel">Previous:</b>
<a class="sectref" rel="prev" href="sequence-matcher.html">4.4.1 SequenceMatcher Objects</a>
<b class="navlabel">Up:</b>
<a class="sectref" rel="parent" href="module-difflib.html">4.4 difflib  </a>
<b class="navlabel">Next:</b>
<a class="sectref" rel="next" href="differ-objects.html">4.4.3 Differ Objects</a>
</div>
<hr /></div>
</div>
<!--End of Navigation Panel-->

<h2><a name="SECTION006420000000000000000"></a><a name="sequencematcher-examples"></a>
<br>
4.4.2 SequenceMatcher Examples 
</h2>

<p>
This example compares two strings, considering blanks to be ``junk:''

<p>
<div class="verbatim"><pre>
&gt;&gt;&gt; s = SequenceMatcher(lambda x: x == " ",
...                     "private Thread currentThread;",
...                     "private volatile Thread currentThread;")
</pre></div>

<p>
<tt class="method">ratio()</tt> returns a float in [0, 1], measuring the similarity
of the sequences.  As a rule of thumb, a <tt class="method">ratio()</tt> value over
0.6 means the sequences are close matches:

<p>
<div class="verbatim"><pre>
&gt;&gt;&gt; print round(s.ratio(), 3)
0.866
</pre></div>

<p>
If you're only interested in where the sequences match,
<tt class="method">get_matching_blocks()</tt> is handy:

<p>
<div class="verbatim"><pre>
&gt;&gt;&gt; for block in s.get_matching_blocks():
...     print "a[%d] and b[%d] match for %d elements" % block
a[0] and b[0] match for 8 elements
a[8] and b[17] match for 6 elements
a[14] and b[23] match for 15 elements
a[29] and b[38] match for 0 elements
</pre></div>

<p>
Note that the last tuple returned by <tt class="method">get_matching_blocks()</tt> is
always a dummy, <code>(len(<var>a</var>), len(<var>b</var>), 0)</code>, and this is
the only case in which the last tuple element (number of elements
matched) is <code>0</code>.

<p>
If you want to know how to change the first sequence into the second,
use <tt class="method">get_opcodes()</tt>:

<p>
<div class="verbatim"><pre>
&gt;&gt;&gt; for opcode in s.get_opcodes():
...     print "%6s a[%d:%d] b[%d:%d]" % opcode
 equal a[0:8] b[0:8]
insert a[8:8] b[8:17]
 equal a[8:14] b[17:23]
 equal a[14:29] b[23:38]
</pre></div>

<p>
See also the function <tt class="function">get_close_matches()</tt> in this module,
which shows how simple code building on <tt class="class">SequenceMatcher</tt> can be
used to do useful work.

<p>

<div class="navigation">
<div class='online-navigation'>
<p></p><hr />
<table align="center" width="100%" cellpadding="0" cellspacing="2">
<tr>
<td class='online-navigation'><a rel="prev" title="4.4.1 sequencematcher Objects"
  href="sequence-matcher.html"><img src='../icons/previous.png'
  border='0' height='32'  alt='Previous Page' width='32' /></a></td>
<td class='online-navigation'><a rel="parent" title="4.4 difflib  "
  href="module-difflib.html"><img src='../icons/up.png'
  border='0' height='32'  alt='Up one Level' width='32' /></a></td>
<td class='online-navigation'><a rel="next" title="4.4.3 differ Objects"
  href="differ-objects.html"><img src='../icons/next.png'
  border='0' height='32'  alt='Next Page' width='32' /></a></td>
<td align="center" width="100%">Python Library Reference</td>
<td class='online-navigation'><a rel="contents" title="Table of Contents"
  href="contents.html"><img src='../icons/contents.png'
  border='0' height='32'  alt='Contents' width='32' /></a></td>
<td class='online-navigation'><a href="modindex.html" title="Module Index"><img src='../icons/modules.png'
  border='0' height='32'  alt='Module Index' width='32' /></a></td>
<td class='online-navigation'><a rel="index" title="Index"
  href="genindex.html"><img src='../icons/index.png'
  border='0' height='32'  alt='Index' width='32' /></a></td>
</tr></table>
<div class='online-navigation'>
<b class="navlabel">Previous:</b>
<a class="sectref" rel="prev" href="sequence-matcher.html">4.4.1 SequenceMatcher Objects</a>
<b class="navlabel">Up:</b>
<a class="sectref" rel="parent" href="module-difflib.html">4.4 difflib  </a>
<b class="navlabel">Next:</b>
<a class="sectref" rel="next" href="differ-objects.html">4.4.3 Differ Objects</a>
</div>
</div>
<hr />
<span class="release-info">Release 2.5.1, documentation updated on 18th April, 2007.</span>
</div>
<!--End of Navigation Panel-->
<address>
See <i><a href="about.html">About this document...</a></i> for information on suggesting changes.
</address>
</body>
</html>
